178 research outputs found

    Decomposition of linear tensor transformations

    Full text link
    One of the main issues in computing a tensor decomposition is how to choose the number of rank-one components, since there is no finite algorithms for determining the rank of a tensor. A commonly used approach for this purpose is to find a low-dimensional subspace by solving an optimization problem and assuming the number of components is fixed. However, even though this algorithm is efficient and easy to implement, it often converges to poor local minima and suffers from outliers and noise. The aim of this paper is to develop a mathematical framework for exact tensor decomposition that is able to represent a tensor as the sum of a finite number of low-rank tensors. In the paper three different problems will be carried out to derive: i) the decomposition of a non-negative self-adjoint tensor operator; ii) the decomposition of a linear tensor transformation; iii) the decomposition of a generic tensor.Comment: arXiv admin note: text overlap with arXiv:2305.0280

    Diagonal Kernel Point Estimation of nth-Order Discrete Volterra-Wiener Systems

    Get PDF
    The estimation of diagonal elements of a Wiener model kernel is a well-known problem. The new operators and notations proposed here aim at the implementation of efficient and accurate nonparametric algorithms for the identification of diagonal points. The formulas presented here allow a direct implementation of Wiener kernel identification up to the th order. Their efficiency is demonstrated by simulations conducted on discrete Volterra systems up to fifth order

    Learning HMM State Sequences from Phonemes for Speech Synthesis

    Get PDF
    AbstractThis paper presents a technique for learning hidden Markov model (HMM) state sequences from phonemes, that combined with modified discrete cosine transform (MDCT), is useful for speech synthesis. Mel-cepstral spectral parameters, currently adopted in the conventional methods as features for HMM acoustic modeling, do not ensure direct speech waveforms reconstruction. In contrast to these approaches, we use an analysis/synthesis technique based on MDCT that guarantees a perfect reconstruction of the signal frame feature vectors and allows for a 50% overlap between frames without increasing the data rate. Experimental results show that the spectrograms achieved with the suggested technique behave very closely to the original spectrograms, and the quality of synthesized speech is conveniently evaluated using the well known Itakura-Saito measure

    ECG-Based Arrhythmia Classification using Recurrent Neural Networks in Embedded Systems

    Get PDF
    Cardiac arrhythmia is one of the most important cardiovascular diseases (CVDs), causing million deaths every year. Moreover it is difficult to diagnose because it occurs intermittently and as such requires the analysis of large amount of data, collected during the daily life of patients. An important tool for CVD diagnosis is the analysis of electrocardiogram (ECG), because of its non-invasive nature and simplicity of acquisition. In this work we propose a classification algorithm for arrhythmia based on recurrent neural networks (RNNs) that operate directly on ECG data, exploring the effectiveness and efficiency of several variations of the general RNN, in particular using different types of layers implementing the network memory. We use the MIT-BIH arrhythmia database and the evaluation protocol recommended by the Association for the Advancement of Medical Instrumentation (AAMI). After designing and testing the effectiveness of the different networks, we then test its porting to an embedded platform, namely the STM32 microcontroller architecture from ST, using a specific framework to port a pre-built RNN to the embedded hardware, convert it to optimized code for the platform and evaluate its performance in terms of resource usage. Both in binary and multiclass classification, the basic RNN model outperforms the other architectures in terms of memory storage (∼117 KB), number of parameters (∼5 k) and inference time (∼150 ms), while the RNN LSTM-based achieved the best accuracy (∼90%)

    an acquisition system of in house parameters from wireless sensors for the identification of an environmental model

    Get PDF
    Abstract This paper presents a system for the acquisition of in-house parameters, such as temperature, pressure, humidity and so on, that can be used for the intelligent control of a building. The main objective of this work is to determine an environmental model of an in-house room using machine learning techniques. The system is based on a low data-rate network of sensing and control nodes to acquire the data, realized with a new protocol, called ToLHnet, that is able to employ both wired and wireless communication on different media. Several standard machine learning techniques, namely linear regression, classification and regression tree algorithm, support vector machine, have been used for the regression of the input-output thermal model. Additionally, a recently proposed new technique named particle-Bernstein polynomial has been successfully applied. Experimental results show that this technique outperforms the previous techniques, for both accuracy and computation time

    a comparative study of machine learning algorithms for physiological signal classification

    Get PDF
    Abstract The present work aims at the evaluation of the effectiveness of different machine learning algorithms on a variety of clinical data, derived from small, medium, and large publicly available databases. To this end, several algorithms were tested, and their performance, both in terms of accuracy and time required for the training and testing phases, are here reported. Sometimes a data preprocessing phase was also deemed necessary to improve the performance of the machine learning procedures, in order to reduce the problem size. In such cases a detailed analysis of the compression strategy and results is also presented

    A Lightweight CNN-Based Vision System for Concrete Crack Detection on a Low-Power Embedded Microcontroller Platform

    Get PDF
    The detection of cracks is a key aspect for assessing the condition of in-service structures such as road, bridges or dams. Therefore it assumes a great importance for civil infrastructures monitoring, road maintenance and traffic safety. Intelligent detection methods based on convolutional neural networks (CNNs) have been widely applied to the field of crack detection in recently years. In this work we propose a vision system based on two lightweight and accurate CNN models implemented in a low-cost, low-power platform, namely the OpenMV Cam H7 Plus, to monitor and to detect concrete cracks in real-time, suitable to realize a prototype of early warning system. In order to be useful, such a system must provide a very high accuracy, so as not to give false alarms, and be parsimonious enough on computational resources to be embedded into low-power, portable systems that can be deployed on the field. To reach this goal, firstly we analyze different state-of-the-art CNNs applied to the concrete crack detection task in order to discover the smallest network in terms of memory storage and number of parameters. Then, we compare the performance, in terms of memory occupancy and accuracy, of the proposed CNN architectures with the smallest network in the investigated literature, LeNet, all trained on two different image datasets, the Concrete Crack Images for Classification dataset and the SDNET2018 dataset, and implemented on the embedded system OpenMV Cam H7 Plus. The proposed CNN architectures perform nicely on this platform, using only a small fraction, between 6% to 26%, of the memory required by LeNet, and always providing better accuracy in all the tested cases and on both the datasets tried, with only a marginal increase in inference time

    Embedded AM-FM Signal Decomposition Algorithm for Continuous Human Activity Monitoring

    Get PDF
    AM-FM decomposition techniques have been successfully used for extracting significative features from a large variety of signals, helping realtime signal monitoring and pattern recognition, since they represent signals as a simultaneous composition of amplitude modulation and frequency modulation, where the carriers, amplitude envelopes, and the instantaneous frequencies are the features to be estimated. Human activities often involve repetitive movements, such as in running or cycling, where sinusoidal AM-FM decompositions of signals have already demonstrated to be useful to extract compact features to aid monitoring, classification, or detection. In this work we thus present the challenges and results of implementing the iterated coherent Hilbert decomposition (ICHD), a particularly effective algorithm to obtain an AM-FM decomposition, within a resource-constrained and low-power ARM Cortex-M4 microcontroller that is present in a wearable sensor we developed. We apply ICHD to the gyroscope data acquired from an inertial measurement unit (IMU) that is present in the sensor. Optimizing the implementation allowed us to achieve real-time performance using less then 16 % of the available CPU time, while consuming only about 5.4 mW of power, which results in a run-time of over 7 days using a small 250 mAh rechargeable cell

    robust speaker identification in a meeting with short audio segments

    Get PDF
    The paper proposes a speaker identification scheme for a meeting scenario, that is able to answer the question "is somebody currently talking?", if yes, "who is it?". The suggested system has been designed to identify during a meeting conversation the current speaker from a set of pre-trained speaker models. Experimental results on two databases show the robustness of the approach to the overlapping phenomena and the ability of the algorithm to correctly identify a speaker with short audio segments
    corecore